Journal article
Boosting association rule mining in large datasets via Gibbs sampling
G Qian, CR Rao, X Sun, Y Wu
Proceedings of the National Academy of Sciences of the United States of America | Published : 2016
Abstract
Current algorithms for association rule mining from transaction data are mostly deterministic and enumerative. They can be computationally intractable even for mining a dataset containing just a few hundred transaction items, if no action is taken to constrain the search space. In this paper, we develop a Gibbs-sampling-induced stochastic search procedure to randomly sample association rules from the itemset space, and perform rule mining from the reduced transaction dataset generated by the sample. Also a general rule importance measure is proposed to direct the stochastic search so that, as a result of the randomly generated association rules constituting an ergodic Markov chain, the overa..
View full abstractGrants
Awarded by National Health and Medical Research Council
Funding Acknowledgements
This work is partially supported by the Natural Sciences and Engineering Research Council of Canada. G.Q.'s research is also partly supported by the Australian National Health and Medical Research Council Project Grant APP1033452.